Nearest Descent, In-Tree, and Clustering
نویسندگان
چکیده
Clustering aims at discovering the natural groupings in a dataset, prevalent many disciplines that involve multivariate data analysis. In this paper, we propose physically inspired graph-theoretical clustering method, which first makes points organized into an attractive graph, called In-Tree, via rule, Nearest Descent (ND). The rule of ND works to select nearest node descending direction potential as parent each node, is fundamentally different from classical Gradient Descent. constructed In-Tree proves very good candidate for due its particular features and properties. original problem reduced removing inter-cluster edges graph. Pleasingly, those are usually so distinguishable they can be easily determined by automatic edge-cutting methods. We also visualized strategy validate effectiveness experimental results reveal proposed method superior related characteristics cutting methods meaningfulness increasing reliability practice.
منابع مشابه
Nearest Descent, In-Tree, and Clustering
Clustering analysis is a method to organize raw data into categories based on a measure of similarity. It has been successfully applied to diverse fields from science to business and engineering. By endowing data points with physical meaning like particles in the physical world and then leaning their evolving tendency of moving from higher to lower potentials, data points in the proposed cluste...
متن کاملNonparametric Nearest Neighbor Descent Clustering based on Delaunay Triangulation
In our physically inspired in-tree (IT) based clustering algorithm and the series after it, there is only one free parameter involved in computing the potential value of each point. In this work, based on the Delaunay Triangulation or its dual Voronoi tessellation, we propose a nonparametric process to compute potential values by the local information. This computation, though nonparametric, is...
متن کاملClustering by Deep Nearest Neighbor Descent (D-NND): A Density-based Parameter-Insensitive Clustering Method
Most density-based clustering methods largely rely on how well the underlying density is estimated. However, density estimation itself is also a challenging problem, especially the determination of the kernel bandwidth. A large bandwidth could lead to the over-smoothed density estimation in which the number of density peaks could be less than the true clusters, while a small bandwidth could lea...
متن کاملNearest Neighbor based Greedy Coordinate Descent
Increasingly, optimization problems in machine learning, especially those arising from bigh-dimensional statistical estimation, bave a large number of variables. Modem statistical estimators developed over the past decade have statistical or sample complexity that depends only weakly on the number of parameters when there is some structore to the problem, such as sparsity. A central question is...
متن کاملNearest-neighbors medians clustering
We propose a nonparametric cluster algorithm based on local medians. Each observation is substituted by its local median and this new observation moves toward the peaks and away from the valleys of the distribution. The process is repeated until each observation converges to a fixpoint. We obtain a partition of the sample based on the convergence points. Our algorithm determines the number of c...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Mathematics
سال: 2022
ISSN: ['2227-7390']
DOI: https://doi.org/10.3390/math10050764